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ABSTRACT 

The Goldstein-Scheerer Tests of Abstract and Concrete 
Thinking are reviewed. This evaluative summary supplements several 
reviews in the Mental Measurements Yearbook series. The 
Goldstein-Scheerer tests are used by clinicians and researchers to 
distinguish normal individuals (those able to engage in two 
qualitatively different levels of cognitive behavior — concrete and 
abstract) from abnormal individuals (organically impaired as well as 
psychiatric sub-groups, capable of concrete behavior only). The 
concrete and abstract attitudes are n^t acquired or learned but are 
capacity levels of the entire personality. The eight characteristics 
of the abstract attitude are listed, as well as descriptions of the 
five sub-tests making up the individual performance test battery: (1) 
Goldstein-Scheerer Cube Test; (2) Gelb-Goldstein Color Sorting Test; 
(3) Gelb-Goldstein-Weigl-Scheerer Object Sorting Test; (4) 
Weigl-Goldstein-Scheerer Color Form Sorting Test; and (5) 
Goldste^.i-Scheerer Stick Test. Intended practical applications, 
intended clientele, and the fact that the test manual includes no 
standard scoring system are briefly discussed. A number of validity 
studies, published from 1945 to the present, are reviewed. Cost of 
the t3St is also mentioned. (MAC) 
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INTRODUCTION 



This paper is an extended review of the Goldstein-Sche erer Tests of Ah>:trarf anH Tf^ nnTi ft 
Think i ng (Jhc Psychological Corporation, 1945). Since the Goldstein-Scheerer tests currently are 
used by clinicians and researchers, this review should provide some insight into the empirical 
evidence bearing on the psychonaetric and statistical properties of the battery. As such, this 
evaluative summary supplements several reviews in the Mental Measiiremftnts Vparh^K "'"^^ 

I would like to thank Professor Robert L.Thomdike for pix)viding guidance in searching out 
some of the older literature and for clarifying my thinking on the psychometric and statistical issues 
discussed here. 

F. O'Brien 
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I. General Introduction 

The present form of the Goldstein-Scheerer Tests of Abstract and Concrete 
Thinking (Psychological Corporation, 1945) evolved from the work of Gelb and 
Goldstein (1920) in the latter's experience with brain damaged patients during and 
after World War I. Gelb and Goldstein believed that one of the main effects of brain 
damage was a loss of the ability to think abstractly C'concreteress"). Their 
experience led them to hypothesize the existence of two qualitatively different levels 
of cognitive behavior-concrete and abstract The normal individi^al is capable of 
engaging in both "attitudes", whereas the abnormal individual (organically impaired 
orginally, but later to include psychiatric sub-groups) is capable of concrete 
behavior only. The concrete and abstract attitudes are not acquired or learned 
behaviors, but are "capacity levels of the total personality" (Goldstein and Scheerer, 
1941, p. 1). The "abstract attiti de" is defined in terms of eight characteristics: 

1 . "To detach our ego from the outerworid or from inner experiences." 

2. "To assume a mcntdi set" 

3. "To account for acts to oneself: to verbalize the account." 

4. "To shift reflectively from one aspect of a situation to another." 

5. "To hold in mind simultaneously various aspects." 

6. "To grasp the essential of a given whole: to break up a given whole into parts> 

to isolate and synthesize them." 

7. "To abstract conmion properties reflectively: to form hierachic concepts " 

8. "To plan ahead ideationally: to assume an attitude towards the 'mere possible* 

and to think or perform symbolically." 

Theoretically, normal individuals possess each of these characteristics; operationally, 
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they exhibit the capacity to perform abstractly on each of the sub-tests of the 
Goldstein- Scheerer battery. Impaired individuals perform concretely on the 
sub-tests because of the inability to do 1 to 8 above. 



11. Description of the Siih-Tests 

The Goldstein-Scheerer Tests o f Ahstracr and Concrete Thinking is an 
individual performance test oattery consisting of five sub-tests which require the 
examinee to sort, classify or make designs with various objects, and for seme of the 
sub-tests, to render a verbal justif cation for a given performative response. The five 
sub-tests and a task description of each are as follows: 



a. Goldstein-Scheerer Cube Test. This sub-test basically is a modification of 
the Kohs Block Design Test The modification consists of a standard set 
of aids given to subjects unable to complete an original design (e.g., 
copying from a life-size picture of the design, from a picture divided up 
into four squares representing each block, from a model of the design, etc.) 

b. Gelb-Goldstein Color Sorting Test The second sub-test in the battery 
consists of a large number of small woolen skeins of varying hue, 
brightness and saturation. The examinee is required to sort the skeins into 
groups according to hue and brightness. In the first part of the tasks, the 
subject picks a skein and sorts others that belong with the selected skein, 
and explains the basis of the sort i" terms of color principles; in the second 
part, the subject mast sort skeins that belong with an original skein selected 
by the examiner, and explain the basis of the completed sort. 

c. Gelb-Goldstein-Weigl-Sheerer Object Sorting Test. This sub-test consists of 
over thirty everday objects. The subject is required to sort objects into 
conceptual groups (e.g., color, form, material, use, etc.) and explain the 
basis of the sort. As in the Color Sorting Test, the examinee initially selects 
an object of his choosing, sorts all objects which are believed to belong and 
ren ers an explanation for the basis of the sort.; secondly, the examinee is 
required to label the class of objects selected by the examiner. 

d. Weigl-Goldstein-Scheerer Color Form Sorting Test. This fourth sub-test 
consists of four sm.ill plastic squares, four equilateral triangles and four 
circles, each figure within a geometrical group reproduced in one of four 
colors. The examinee is asked to sort all twelve figures into groups and 
explain the principle of the sort, and then to resort all the figures in a 
different way, and again, to explain tlie principle of the sort. 



e. Goldstein-Scheerer Stick Test The last sub-test in the battery consists of 
small plastic rectangular solids of several sizes which differ in length. The 
examinee is required to reproduce patterns with the sticks from pictures and 
from a memory of the pictCKial designs presented one at a time. 

No standard scoring system is suggested in the Goldstein-Scheerer manual 
(1941) for any of the five sub-teSiS. The examiner must decide whether the 
examinee exhibits the "concrete'* or the "abstract attitude on each sub-test derived 
from examples of kinds of responses that suggest an abstract or concrete attitude. 
The overall decision of assigning an individual's sub-test responses to the abstract or 
concrete category is likewise not explicitly stated in the manual. Owning to the 
original purpose and use of the instrument, the examiner is left to assess the patient's 
performance subjectively; the individual user of the test must rely on accumulated 
clinical experience for scoring individual response behavior. Zangwill, reviewing 
the test in the 3rd MMY (1949) made this point about the Goldstein-Scheerer battery 
by comparing it to the Rorschach: 

"It is essential in both cases to gain the necessary clinical experienc: in 
ad;ninistering the tests before the results can be evaluated with any 
confidence." 

In general, although a relatively standard method of administration is described in 
the manual, Goldstein and Scheerer do not present objective standardization data of 
any sort, nor are validity indices or reliability measures presented in the manual. 

in. Intended Practical Application^} 

The authors intended the instrument to be used as a clinicpl adjunct in the 
diagnosis of brain damage. However, the instrument is used in the diagnosis and 
classification of the non-organically impaired such as schizophrenia. 



IV. Intended Clientele 

The test battery was intended originally for use with brain damaged adults, 
but the instrument has been used widely with other adult non-organically impaired 
individuals, There has been use made of some of the sub-tests for research 
purposes with children, but no evidence has been found indicating that the test has 
been used diagnostically with children. 

V. V-Jiditv Studies 

Introduction 

Establishing the validity of the Goldstein-Scheerer battery is problematical for 
several reasons. The five sub-tests of the Gcldstein-Scheerer battery represent a 
unique instrument in diagnostic psychological testing in providing no quantitative 
data in the manual on the subjects used, in providing no percentage of incorrect 
diagnoses, ipjoring possible confounding variables such as age and IQ, and in 
assuming naively that the performance of normal individuals of average intelligence 
will be without error on the sub-tests. Although it is possible that the sub-tests do 
have high validity in the hands of a skilled clinician , it appears that the validity of 
the sub-tests is rather private to the individual user. Yates (1954) has said of the 
validity of the Goldstein-Scheerer battery: 

"The ba^ic criticism to be made of these tests is not that they are invalid, but 
that there is no basis for a discussion of their validity" (p. 360) 

The validity studies that have been conducted over the years cn the 
Goldstein-Scheerer are difficult to subsume neatly; several reasons contribute to this. 
First, some investigators who have been dissatisfied with the lack of a standaixiized 
scoring system for any of the sub-tests and with the ambiguities present in the 
procedures for . mistering parts of the sub-tests, have modified both 
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administration and scoring procedures as well as supply local norms for certain 
sub-groups. Secondly, when validity studies of various kinds have been conducted, 
they have tended to be concerned with one, two or at most, three sub-tests of the 
original battery. And even in the endeaver to catalogue the validity of certai:; 
sub-tests, the results have been inconclusive and difficult to interpret. 

Studies 

Some investigators have noted that the extensive definition given of the "abstract 
attitude" may be difficult to distinguish from measures of general intellectual 
functioning. Payne and H-wlett (1960) assembled a large battery of tests in an 
attempt to discover what differences, if any, exist between normals and 
non-organically impaired individuals. They were specifically interested in the 
thought patterns of schizophrenics. Previous work had suggested that 
schizophrenics were abnormally "concrete" in thinking, whereas the present authors 
hypothesized that concreteness was primarily a function of "overinclusive thinking" 
(i.e., sorting which includes objects not belonging with the content of the -oncept, 
or a sorting whose concept is too inclusive in reference to the objects sorted). The 
battery of tests included measures of "psyc'^oticism," "general intelligence," (Mill 
Hill Vocabulary Test, Nufferr.o Level Test, and the Arithmetic and Picture 
Completion sub-tests of the W-B Intelligence Scale), "psychomotor re tardation," 
and "overinclusive thinking," and "concreteness" (five measures including the 
Object Sorting Test and the Color Form Sorting Test of the Goldstein-Schcerer 
battery). The tests were administered to 20 normals, 20 neurotics, 20 depressives 
and 20 schizophrenics who were matched for age, educational level, occupational 
status and pre-illness intellectual level (Mill Hill Vocabulary Test). Scoring 
procedures were developed for some of the sub-tests including the 
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Goldstein-Scheerer sub-tCots. The scores for the entire group were combined 
(which is questionable because the sub-groups do not come from the same parent 
population since one-way A^OVA's shewed unequal means to exist between the 
sub-groups on the majority of measures used which results in spuriously high 
correlations) and product-moment correlations were computed among all the 
measures. This intercorrelation matrix was factor analyzed using Thurstone's 
method for extracting controid factors. See the attached copy of the three unrotated 
factor loadings and rotated factor loadings. The rotated factor loadings were obtained 
according to the criterion of maximum discrimintion between sub-groups in 
accordance with hypotheses of group differences that were of interest to the 
investigators. The loadings were computed in the following manner. Discriminant 
function weights were obtained for the three unrotated centroid factors on various 
sub-groups of interest (for example, between the depressive sub-group and the 
schizophrenic sub-group, etc.). Any such set of three weights were premultiplied 
into the three centroid factor loadings on a given variable to obtain a particular 
coluLin of rotated factor loadings. The "O" factor ("overinclusiveness"), the "R" 
factor C'retardation") and the "G" factor C'general intelligence") are orthogonal 
factors. The "P' factor C'psychoticism") is an oblique factor (correlated with the "O" 
factor). It is notable that the two Goldstein-Scheerer measures of "concreteness" 
load equally as well on the "G" factor as the more conventional measures of 
intellectual level. The reversed loadings of the two Goldstein-Scheerer sub-'ests 
reflect the way they were scored: a low score meant abstract, a high score meant 
concrete. These high loadings for the Goldstein-Scheerer sub-tests tend to indicate 
that those measures are just as good measures of intellectual level as more 
conventional measures of intellectual status. The small number of subjects used in 
this study in relation to the number of variables studied is noteworthy. It is usually 
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recx)mmended as a rule of thumb that there should be approximately ten times as many 
subjects as there are variables for a factor analytic study (Harman, 1967). 

Boyd (1949) provided further evidence that intelligence may be an important factor 
in performance on some of the Goldstein-Scheerer sub-tests as well as an example that 
some the battery is not totatally resistant to quantification. Boyd devised a scoring 
system for the Cube Test by assigning scores for lovels of performance and explanation 
(a step-wise scoring system corresponding to the "helps" for tho original sub-test). 
When Boyd divided a group of 54 normal and female subjects into sub-groups according 
to levels of W-B Full Scale scores, he found that Cube Test scores varied as did IQ scores. 
He also computed upper/lower limits of Cube Test standard deviations in order to 
compare abnormals. When Boyd compared certain Cube Test scores of brain damaged 
and non-organica'ly impaired subjects with the control group "norms" he found a slight 
trend of lower Cube Test scores for the deviant sub-groups as compared with the 
corresponding normal IQ groups, but the N's are too small to draw reliable statistical 
inferences. 

McGaughran and Moran (1956) found variance in performance on the Object Sorting 
Test to be more closely associated with esti ated test intelligence and educational level 
than with psychiatric diagnosis. The authors tested two groups of individiuals. one group 
consisting of 37 male patients diagnosed as paranoid schizophrenics, the second group 
consisting of 40 male pateints in a VA hospital with non-psychiatric diagnoises; 37 of 
this latter group were matched with the 37 psychiatric patients for estimated test 
intelligence (vocabulary scores on the W-B), educational level and ape, Ths Object 
Sorting Test was administered in accordance with the Rapaport (1945) modification. 
This modification for this sub-test has been widely adopted by investigators. The changes 
made by Rapaport involve using thirty three common objects that are very ^.imilar to the 
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Goldstein-Scheerer materials in which the examinee must sort the objects under two 
conditions or phases called the "active" and the "passive" (or compliant) phases, 
both of which closely conespond to the tasks described in the Goldstein-Scheerer 
and the Rapaport modification of the Object Sorting Test resides in the exteded 
sub-divisions that Rapaport has given to the levels of "abstract" and "concrete". The 
scoring system used by McGaughran and Moran in the present study is a slight 
modificaiion of Rapaport's scoring system and was developed for the purpose of 
testing hypotheses which the Rapaport scoring system is not able to handle. Two 
methods of scoring sorting and verbalizations were employed in the present study: 
responses were first scored for "conceptual level" and then for "conceptual area". 
The Conceptual Area scoring method was deveopcd by the authors for the purpose 
of testing hypotheses concerning the nature of scliizophrenic thought patterns, which 
state that schizophrenics are not so much deficient in abstract ability as they are in 
social communication. The authors concluded that 

"The major differences be veen the schizophrenic subgroups based on 
intellegence and education ii in the relative publicness-privateness of their 
conceptualization while the major differences between the nonpsychiatric 
subgroups is along the open-closed variable." (pAl) 

The only discoverable evidence of the factorial composition of the 
Goldstein-Scheerer battery are two brief reports by Silverstein (1960) and 
Silverstein and Moran (1965) for the Object Sorting Test. Silverstein (1960) 
performed a cluster analysis on data reported by Rapaport £Lal. (1945). The data arc 
apparently levels of performance on the Object Sorting Test as scored in accordance 
with Rapaport's scoring system. Thirteen variables from the sorting test were 
selected for analysis. The subjects consisted of "54 randomly selected members of 
the Kansas Highway Patrol." The score distributions, were dichotomized at the 
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median and intercorrelated using the phi/phimax index of association. The analysis 
employed was "McQuitty's elementary linkage analysis." Three clusters were 
obtained, two of Wnich were identified as corresponding to the two major conceptual 
areas of the sorting test as described by Rapaport (1945). One is called "active 
concept formation" and corresponds to the first pan of the Goldstein-Schecrer 
sorting test in which the subject is required to select an object and sort all other 
objects that belong and give an explanation of the conceptual basis for the sort made. 
The second cluster was identified a^. corresponding to "passive concept formation," 
which corresponds essentially to the second part of the Goldstein-Scheerer 
procedure in which the subject must name the concept underlying the group of 
objects selected by the examiner The third cluster obtained corresponds to an 
explanation of the concept underlying a grouping in terms of function or everyday 
usage (functional definitions can be scored in both the ctive and passive phases of 
the Rapaport scoring system). The term also corresponds to a concrete explanation 
in the Goldstein-Scheerei sconng system (1941). 

The cluster analysis - uggests to Silver,iein that there is an absr .^ct-concrete 
continuum present m verbal conceptual t>whavior with functional definitions 
representing an inu pendent category, and not an abstract/concrete dichotomy. For 
tiais reason, Silverstein reconmiends changes in ti^e scoring systems for measures of 
verbal cc -^prehension, such as the Weschler Vocabulary sub-test, which call for 
active definitions of terms. Silverstein reconmiends giving less credit for a 
functional definition than for a concrete definition rather than the conventional 
procedure of giving credit in an abstract-functional-concrete order. 

In anotiier brief report Silverstein and Moran (1965) describe the results of a 
factcM- analysis performed io determine the relation between intelligence and 
performance on the Object Sorting Test As in the 1960 cluster analysis reported 
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Silverstein, the Rapaport modification was used to score the mai n levels of 
performance on the sorting tesL The data used were those reported by R?paport for 
three groups: normals, neurotics and schizophrenics (sample sizes are not given). 
Thirteen measures from the sorting test along with the W-B Full Scale IQ's were 
selected for analysis. All of the score distributions were dichotomized, tetrachoric 
correlations were calculated and the resulting correlation matrices, one for each of 
the three groups, were factor analyzed using principal components followed by 
orthogonal rotation. The resulting factor solutions were not in complete agreement 
with one another, although they showed structural similarities. A single factor 
corresponding to active concept formation for the normal group split into two 
factors, which were interpreted as "Sorting" and "Verbalization" for the neurotic and 
schizophrenic groups, respectively. For all three groups there was a factor 
corresponding to passive concept formation, and for each group the IQ sub-tests 
loaded highest on this factor. Lastly, a doublet occured for functional definitions 
from the the two parts of the Object Sorting Test. 

Silverstein draws three conclusions from the analyses. First, the results support 
Rapaport's distinction between active and passive concept formation which are 
involved in the two parts of the Object Sorting Test. This suggests that there is an 
abstract-concrete continuum in conceptual behavior. Secondly, the findings 
demonstrate that intelligence as measured by the W-B, at least, is more closely 
related to passive rather than active concept formation. Lastly, functional defmitions 
do not lie on the abstract-concrete continuum but are an independent entity. 

Although the Goldstein-Scheerer battery was developed principally for adults, 
some use has been made of some of the sub-tests in the study of conceptual 
development in children and adolescents. A brief sunmiary of some of the main 
fmdings of some of the studies found in the literature will now be given. 
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One of the earliest studies in this area was the expenment performed by Reichard, 
Scheider and Rapaport (1944) in which 234 normal white children, ages 4 to 14, 
were tested with the Object Sorting Test and the Color Form Sorting Test. The 
scoring system for the Object Sorting Test used in the present study is the one that 
Rapaport fiLal- (1945) later came to adopt for that sub-test. The Color ^rm Sorting 
Test was scored in terms of the number of conceptual shifts exhibited by the 
children; i.e., the shift from a color sort to a form sort of vice versa. Using a 
chi-squarc test for within age group differences, it was found that at around 7 to 8 
years of age, the ability to perform the two sorts became dominant on the Color 
Form Sorting Test. Moreover, throughout most of the age range tested, grouping in 
terms of form appears to be more probable than color grouping. The Object Sorting 
test results showed that, in the course of childhood and adolescence, the accuracy of 
active (sorting;) concept f^r^nnation develops earlier that the accuracy of definition in 
passive concept formation. The percentages of conceptual definition were higher in 
the active sorting phase than in the passive phase where the individual must explain 
the basis for an examiner made grouping, but no significance testing wa^ reported. 

Goldman and Levine (1963) in an excellent cross-sectional study found the same 
basic kind of concept formation changes to be a function of age. The investigators 
compared 8 groups of subjects (approximately 15 subjects per group) which 
represent the earliest school grades (K to grade 4), the middle school grades (6 and 
9), college students and practicing scientists. Using their own elaborate scoring 
system for the Object Sorting Test, the authors found significant mean differences 
for almost every sub-group comparison made. The overall impression one gets 
from the study is analogous to the results reported by Reichard fiUl. (1944) 
concerning active and passive concept formation. 

Heald and Marzolf (i953) employed the Stick Test and the Color Form Sorting 




Test of the Goldstein-Scheerer battery with 138 r Drmal elementary school children 
ranging in age fiom 6 to 1 1 (n*s range from 15 to 35 for the score distributions 
analyzed). The Stick Test was scored one point for each reproduction made from a 
picture or from memory of a picture. The Color Form Sorting Test was scored only 
for the percent of shifts at eac h age level. The Stick Test is apparently a very simple 
test even for children, as the range of percent correct for the age groups is from 90% 
for 6 years to 100% for the 1 1 years olds, the overall percent correct being 96%. 
No sex difterences occured. The authors present findings on the Color Form 
Sorting Test which are quite diffenrent from those reported by Reichard et ah 
(1944). The present authors found tl.at approximately 89% of the subjects in their 
experiment were able to sort the geometrical figures by both form and color, 
whereas only 56% of the subjects in the Reichard et al. study were able to sort both 
ways. Again, no sex differences were found on (his task. 

Although it would have been interesting to have seen results on the Object 
Sorting Test to be able to compare them with previous results, the authors seem to 
present fairl> good evidence that elementary school children are capable of abstract 
behavior, at least as measured by the two sub-tests used in this study. 

VI. Cost Effectivene5;s 

Statistics on cost effectiveness come from the 6th MMY. 
Total testing time: 30-60 minutes. 
Complete set of testing materials: $64.00. 
Individual sub-tests: 

a) Cube Test: $ 5.75 per set of 2 design booklets and supplementary 

manual: $4.50 per set of Kohs' blocks: $3.50 per 50 copies of 
either seperate record booklet 

b) Color Sorting Test: $14.50 per set of wool skeins: $2.80 per 50 

record booklets. 
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c) Object Sorting Test: $16.00 per set of objects: $4.20 per 50 

record booklets: $1.00 per 50 supplementary sheets. 

d) Color Form Sorting Test: $7.25 per set of blocks: $2.80 per 50 

record booklets. 

e) Stick Test: $4.25 per set of sticks and supplementary manual: 

$2.80 per 50 record booklets. 

Manual: The 1941, 156 pp. monograph is $2.25 per copy. 
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